Sequence Package Analysis: A New Natural Language Method for Mining User-Generated Content for Mobile Uses

نویسندگان

  • Amy Neustein
  • A. Neustein
چکیده

A. Neustein and J.A. Markowitz (eds.), Mobile Speech and Advanced Natural Language Solutions, DOI 10.1007/978-1-4614-6018-3_5, © Springer Science+Business Media New York 2013 Abstract Paradoxically, in an era when cyber-postings proliferate on the Web, much of the valuable information that can be mined from user-generated content (UGC) still eludes most mining programs. One reason this massive amount of UGC is, for all practical purposes, “lost” in cyberspace has to do with the limitations inherent in existing approaches to natural language understanding. In this chapter, I will explore how Sequence Package Analysis (SPA), a new natural-language datamining method for text-based and spoken natural-language input, locates and extracts valuable opinion-related data buried in online postings—and makes such data available to mobile users. The SPA mining method can be used with existing SLM systems to assist in both supervised and unsupervised training. This chapter demonstrates that the advantage of SPA in such contexts is twofold: First, by breaking down unconstrained, open-ended natural-language input into relevant sequence packages, SPA can streamline the process of classifying a vast number of sentences (or spoken utterances); second, as the SPA algorithms become more robust, the process of collecting and classifying natural-language input can be automated entirely, thereby replacing human annotators with SPA-designed machine-learning. Using several examples, randomly selected from the TripAdvisor website, I illustrate how SPA can render the hidden attributes of online reviews (both positive and negative) more visible to the mobile user.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences

Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...

متن کامل

Sequence Package Analysis: A New Natural Language Understanding Method for Performing Data Mining of Help-Line Calls and Doctor-Patient Interviews

Designers of audio mining programs must confront the complexities of natural language dialog, which is replete with ambiguities, circumlocutions and ellipses. Speakers often make requests, lodge complaints, or report on problems in such roundabout ways that attempts to find a statistically probable word match between the application vocabulary and the user’s speech can yield unsatisfactory resu...

متن کامل

Developing a Recommendation Framework for Tourist by Mining Geo-tag Photos (Case Study Tehran District 6)

With the increasing popularity of sharing media on social networks and facilitating access to location technologies, such as Global Positioning System (GPS), people are more interested to share their own photos and videos. The world wide web users are no longer the sole consumer but they are producers of information also, hence a wealth of information are available on web 2.0 applications. The ...

متن کامل

Sequence Package Analysis: A New Method for Intelligent Mining of Patient Dialog, Blogs and Help-line Calls

The ambiguities, repetitions and ellipses commonly found in natural language dialog continue to hinder speech (and text) analytic mining programs that glean business intelligence data from consumer help-line calls, or extract important medical diagnostic information from doctor-patient interviews or consumer-generated health-related blogs. This poses an even greater problem when such mining pro...

متن کامل

Sentiment Analysis in Hindi Language : A Survey

With recent development in web technologies and mobile technologies, with increasing user-generated content in Hindi on the internet is the motivation behind the sentiment analysis Research that is growing up at a lightning speed. This information can prove to be very useful for researchers, governments and organization to learn what’s on public mind, to make sound decisions. Opinion Mining or ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013